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04 . Abstract. We solve the dynamic Predecessor Problem with high probability (whp) in constant time, 

' using only n^^'^ bits of memory, for any constant S > 0. The input keys are random wrt a wider 

clciss of the well studied and practically important class of (/i, /2)-smooth distributions introduced 

' in [3]. It achieves 0(1) whp amortized time. Its worst-case time is 0{-^ TogU^gTi')- ^l^o, we prove whp 

O (log log log n) time using only n^^i°iT°i~" — n^+°'^' bits. Finally, we show whp O(loglogn) time using 
0{n) space. 

Keywords: dynamic data structure, predecessor search, random input, probabilistic analysis. 

Q 

^ • 1 Introduction 
o 

The problem. Suppose that X is a dynamic0 file of n keys each represented by £ < 6 bits 
of memory, drawn from a universe U consisting of |?7| = 2^ possible words, in a unit-cost 
■ RAM with word length b = log \ U\. These i bits are needed for representing all n keys of 
^ . X, so, £ must satisfy log \ X\ = \ogn < i < b = log \ U\. The goal is to wisely (preprocess 
Tj- ! X) and efficiently store (total bits for the n keys of file X to be as close to n, while the i 
Tij- I bits per key in X to be as close to the minimum amount logn) file X in a data structure 
O I in a way that important queries are performed in the most efficient query time (as close to 
0(1)). "Time" refers to the number of probed (compared) memory words until the query is 
complete. Perhaps the most basic is the Membership Memb(x,X) query: determine if x G X, 
for an arbitrary x eU. The T)el{y,X) query: delete y E X is reduced to Memb(?/,X) in 0(1) 
time. Extensively studied [5] and more complex to perform is the Predecessor Pred(x,X) 
query: determine the largest y E X such that y < x, for an arbitrary x E U. Finally, the 
Insrt(?/,X) query inserts y eU into file X. 

Complexity. It is known that membership queries on X require 0(1) time and linear 
space 0(n) [8J and [20j via exploiting the power of double hashing. However, searching for 
predecessors is substantially more complex, as manifested by an early ground breaking result 
of Ajtai [Ij proving the impossibility of achieving constant query time using only polynomial 
wrt n = \X\ space. In more words, Ajtai proved that if the memory bits per word are 
i = O(logn), that is, only "short" words are available, and if only n*^^^-* such short words 
can be used for representing a set |X| = n, then, it is impossible to perform predecessor 
search in constant time. In sharp contrast, if "long" words of length i > n are available. 
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It supports updates as insertions of new and deletions of existing keys, otherwise X is static. 



Ajtai, Fredman, and Komlos [2] proved that only 0{n) "'long' words suffice for finding 
predecessors in constant time on |X| = ri. A flurry of subsequent papers [T9|5|21f23] were 
motivated and built on the work of [Ij. The result [2] when combined with [T], illustrates 
how crucially limits the query performance the total available space when measured by the 
overall bits. Thus, it is important to assume that the £ bits per key of file X are very small 
wrt file size n, otherwise the space requirement would be huge. Quoting from [5, Sect. 2] 
^\..The most interesting data structures are those that work for an arbitrary universe whose 
elements can fit in a single word of memory (i.e. \U\ < 2^) and use a number of words that 
is polynomial in n, or ideally 0{n) 

Motivation. A key ingredient of the approaches in [T][T9f5|2T|23] was to exhibit a "bad" 
distribution of inputs that, when combined with an also "bad" distribution of queries, "ill 
advised" the data structure to high query time. Notably, such bad performance was un- 
avoidable even by allowing the (randomized) data structure to take random decisions [23] of 
rejecting excessively/unconveniently long queries. But, at this point an important question, 
that highly motivates our work, arises: is it possible to circumvent these lower bounds by 
just considering more "natural" and as general as possible real world distributions of in- 
put keys? As Frieze and Reed suggest [H], instead of analyzing the worst-case (but rare) 
performance of algorithms, it is often more telling to analyze their performance on typical 
(highly probable) instances. Clearly, a natural goal is the algorithm to remain fast on the 
overwhelming majority of random inputs. Perhaps, the idea of exposing information from 
uniform input distribution as to facilitate the performance of a data structure goes back at 
least to Peterson's method of Interpolation Search (IS) [21]: recursively drive each probe for 
locating Pred(?/,X) towards to (an estimation of) the expected location of y in file X. He 
announced 0{n) such real (but "long") words suffice to find predecessors in order of log log n 
expected time. Even for the restricted case of uniform (or, for transforming to uniform an 
arbitrary but known) continuous distribution, the analysis of IS attracted a lot of inter- 
est |12lll4ll25|26|27ll29j . In Knuth's infamous list of top searching problems, it was placed as 
1st the '^Average case of IS", emphasizing the importance of the typical (most likely) data 
structure performance on (most general) random input distributions, see Sedgewick's list 
[28] . However, uniform input reals is a quite restrictive (strong) assumption for real world 
applications, since these are not easy to produce or handle in all practical applications. 
Willard [32] made a breakthrough by introducing unknown regular input distributions on 
reals (once more). In words, the probability density can fluctuate over a given range, but, its 
ffist derivative remains bounded, thus, bounding the estimated location error. Mehlhorn and 
Tsakalidis [18] studied the dynamic version for the larger class: unknown {\n°'], y/n) -smooth 
continuous distributions, permitting more control to the distribution of the real keys in each 
real subinterval. Intuitively, now 1st derivative may be unbounded. This class contains the 
class regular distributions, notably regulars correspond to ([n"'], 0(l))-smooth ones. Finally, 
the state of the art of the most general unknown input distributions are the (/i, /2)-smooth 
ones (Def. [21 in App. [A]) considered by Andersson and Mattson [3J, for even more general 
/ij/2 parameters. Recent advances in database theory and applications [13] indicate the 
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importance of IS variants on boosting the search performance on the most general random 
(skewed) input distributions. 

However, all these and many other IS based approaches (variants) p[TMl[TnTg]|2gll25]l2i]l52]|2^ 
raised the following two crucial concerns about their space and time efficiency. First, concern- 
ing space, in order to facilitate their probabilistic analysis, all these approaches [3|12|14|15fl8|26|25f24|32|29l 
have a common suffering assumption. They are limited to random real keys of infinite word 
length, thus, it is questionable how to bound the total bits of space by a polynomial of 
n = \X\. Frankly, it is not obvious to store n real numbers in just 0{n) bits while achieving 
fast query time. More on this, although these approaches require 0{n) real (but "long") 
words of memory, it is quite telling to recall our discussion in the previous paragraph, about 
how the predecessor time is boosted to a constant by just conveniently assuming "long" 
words of memory in p], opposed to the corresponding superconstant lower bound proved 
in [Tj by assuming "short" ones. Also, real keys do have theoretical significance, but, in all 
applications it is highly not trivial to produce or handle. Finally, concerning running time, 
these approaches P|12lll4fl5fl8ll26ll25f24ll32f29j cannot run beyond the 6* (log log n) expected 
time for so general as the smooth input distributions in [3]. 

Our result. Under the most natural and even wider assumptions about the random in- 
put keys that have been considered up to today [3], we manage to alleviate all lower 
bounds for the dynamic predecessor search problem p!1[T9|5ll2T1[23] . by proving constant 
time with high probability (whp), as n grows large, thus improving over all approaches in 
Pll2fl4|15lll8ll26|25|24|32|29] that achieved 6* (log log n) expected running time for such wide 
random class of inputs. The fine details of our dynamic data structure exhibit that it whp 
achieves constant predecessor time working with only 0{n) "short" memory words of length 
at most O(logn), just consuming an overall of n^~^^ number of bits, for any constant 6 > 0. 
Also, our structure achieves O (log log log n) whp query time using only n"^iogiogn = 7^1+0(1) 
space, which is an exponential improvement over the pervious IS results. Finally, we show 
whp O (log log n) time using 0{n) space. In this work we mainly demonstrated the asymp- 
totic efficiency of the data structure. The tuning of constants for practical purposes is a 
matter of further work. 

One could claim that just a simple trie with fan-out yields 0(1/5) time and linear 
space. But note that this cannot (at least to the best of our knowledge) implement predecessor 
search in constant time. The reason is that, for the widest class of random input studied 
up to now, we managed to reduce whp the arbitrary universe size \U\ to the significantly 
smaller size 2*^^°^", but, constant C is > 1. Hence the lower bounds in [21] rule out any hope 
for linear space. In fact, our way (Sect. 14. 7p can be viewed as an efficient VEB trie variant 
of |22l Lem. 17], which improves time to 0(log(l/5)). 

Theorem [1] below depicts the performance characteristics of our dynamic data structure 
(DS). These stem from the combination of many properties, formally proved in Section HI It 
is more helpful to visit Section 12] first, illustrating the basic parts of DS. It also presents the 
underlying intuition that governs our probabilistic analysis. Also, it suggests a way to follow 
the line of detailed proofs in Section HI 
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Theorem 1. DS initially stores a file X of n keys drawn from an unknown {n'^ ,n°')- smooth 
inpu^ distribution P, with < a < 1 and constant 7 > 0. For any positive constant 5 > it 
has the following properties: 

1. Requires 0{n^^^) space and time to built. 

2. Supports f2{n-^^^) updates as: Pred(?/,X), Del(?/,X), Insrt(y,X), Memb(?/,X). We as- 
sume that Insrt(y,X) and Del(?/,X) queries can occur in arbitrary order, only satisfying 
that 0{n) keys are stored per update step, while the rest queries are unrestricted. 

3. Supports 0(1) query time with high probability (whp). 

4. Supports O(logloglogn) query time whp, using only n^^^°^ i°g" = n^^"^^'^ space (built time). 

5. Supports 0(loglogr;,) query time whp, using 0{n) space (built time). 

6. Supports 0(1) whp amortized update time. 

7. Supports 0(y^j^^^) worst-case query time. 

Proof. 1. Sect. 13.11 describes how DS is built. Sect. 14.71 proves in Th. [5] the required time 
and space to construct the upper static DS part. Finally, Sect. 14.81 proves in Corol. |2] the 
corresponding time and space required by the lower dynamic DS part. 

2. Sect. 14.3] describes in Property [2] the sufficient conditions for performing Q{n^^^) updates 
on the lower dynamic DS part. 

3. When handed an arbitrary query, we ffist proceed this query to the upper static DS part, 
as Sect. 14.71 describes. Therein Theorem [5] proves that the handed query is driven whp in 
0(1) time to the lower dynamic DS part. Finally, Sect. 14.81 describes how the handed query 
is implemented and Corollary [2] proves that query is whp answered in 0(1) time. 

4. 5. These are proved in Corollary [TJ 

6. Sect. 14.91 proves the 0(1) whp amortized time. 

7. Sect. 14.101 proves the T^^^) worst-case time. 

2 Related work, smooth input, definitions 

Related word. The classical Van Emde Boas data structure [B] achieves 0(log£) query time 
in exponential space, but the space can be reduced to linear when hashing is used [21] . Fred- 
man and Willard [T0ll9] proposed the Fusion tree which achieves linear space with 0(^j) 
query time. When combining the fusion and Van Emde Boas bounds one gets a query time 
bound 0(min(log£, -y/Iogn)). Capitalizing on this, Beame and Fich [5] were the ffist to dive 
below the van Emde Boas/Fusion tree barrier , that is they improved the query time to 

0(min(jj^jSl_^ yC^^Z)) while consuming quadratic in n space. The exponential tree jl] is a 
generic transformation which transforms any static polynomial space predecessor data struc- 
ture with polynomial construction time into a dynamic linear space predecessor data struc- 
ture with efficient updates. Combining the Beame and Fich data structure with exponential 

search tree produces a data structure with linear space and 0(min(loglogn- \J'^^^^)) 
query/update time. Finally Patrascu and Thorup [21] explore in depth every fine detail of 

^ This is superset from the class considered in [3], see details in Def . [2] & [1] and in Sect. 14.41 
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the time versus space interplay on a static set of n keys of i bits each. In particular they 
have shown that for £ = O(logn), that is for the case of a polynomial universe \U\ = 2'^(^°s")^ 
constant query time is possible and the optimal space is 0{n^^^), where 6 is any positive con- 
stant. In our case, \U\ can be arbitrarily larger than ^21j above, but we managed to reduce it 
to a polynomial one, via exploiting the smoothness of the random input. Constant expected 
search time was proved for IS in [3l Cor. 12] for random real input keys of a bounded prob- 
ability density, formally for the restricted class of (/i = n, f2 = l)-smooth densities (Def. [2] 
below). The work in [TJ] removed the dependence on n for the expected search time by using 
finger search and obtaining 0{\oglogd) time, with d the distance of the targed key from the 
key pointed by the finger. The work in [16] is the first that extends the analysis of IS to 
discrete keys. The work [7] removes the independence of input keys. We give the definition 
of smoothness for discrete input (see Def. [2] in App. El for the continuous case). 

Definition 1 ([16j). An unknown discrete probability distribution P over the discrete keys 
xi, . . . ,xn of the universe [a, b] is (/i, f2)-smootl^, if the exists a constant 13 such that for all 
constants a < ci < C2 < C3 < 6 and integers n, for the conditional probability of a ^-random 
key y to hold: 



C3 — Ci 

C2 <y < C2 Ci < y < C3 



£ P[ci, C3](x.)</3^'^'' 



n 



where P[ci,C3](xj) = for < Ci or Xi > C3, and P[ci, C3](xj) = P(xj)/p for Xi G [ci, C3] 



3 Data structure, underlying probabilistic intuition, proof plan 

3.1 Implementation outline 

DS operates in two phases. The preprocessing phase samples n random input keys and fine 
tunes some DS characteristics. The operational phase consists of Q{n}^^) update steps, each 
taking 0(1) time whp. 

Preprocessing phase: n random keys from the universe [/ = {0, . . . , 2''''} are inserted to 
the dynamic file X, drawn wrt an unknown (/i, /2)-smooth distribution (Def. [1]). We order 
increasingly the keys in X as: 

X = {xi,...,Xn} (1) 
We select p — 1 < n representative keys (order statistics): 

7^ = {ro,...,r^} (2) 

^ Intuitively, function /i partitions an arbitrary subinterval [01,03] C [a,b] into /i equal parts, each of length 
'^^J^^ = 0{j^); that is, /i measures how fine is the partitioning of an arbitrary subinterval. Function /2 guarantees 
that no part, of the /i possible, gets more probability mass than that is, /2 measures the sparseness of any 
subinterval [c2 — ^^j^,C2] C [ci,C3]. The class of (/i, /2)-smooth distributions (for appropriate choices of /i and 
/2) is a superset of both regular and uniform classes of distributions, as well as of several non-uniform classes 
[3118) . Actually, any probability distribution is (/i, ©(n))-smooth, for a suitable choice of /?. 
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out of the ordered file A" in ([T]): for < i < p (bucket) representative key G 7^ in (|2]) is the 
kej0 Xia(n)n G A' in ([T]), with tq = 0,rp = 2*. We store an encoding (indexing) of TZ into a 
VEB data structure (Sect. ST]), which consists the static upper part (sDS) of our DS (App. [B| 
Rem. [1]). Each pair of consecutive keys ri,ri+i G TZ depicted in ([2]) points to the ith bucket 
with endpoints [r^, rj+i), for each i G [p — 1]. In turn, the ith such bucket is implemented as 
g*-heap [30]. These g*-heaps consist the dynamic lower part (dDS) of our DS (App. [B| Rem. 
[2]), which can be accessed (Sect. 14. 8p by any query wrt an arbitrary key y E U in 0(1) time 
only through the upper static sDS part of our DS. 

Properties: Observe that sDS may takes 0{n^~^^) built time, but this is absorbed by the 
n{n^~^^) update steps of the operational phase (App. [B], Rem. [1]), thus safely inducing low 
whp amortized cost. By a probabilistic argument (Th. H]), each representative ri E TZ whp 
can be uniquely encoded with Cilogn bits, Ci = 0(1), yielding overall sDS space 0{n^^^) 
for any constant S > 0, while performing PTed{y,7Z) in 0(1) time for any key y E U (Sect. 

m- 

Operational phase: consists of f]{n^^^) queries (App.[B| Rem.[T]) as: Pred{y, X), Del{y, X), 
Insrt(?/,X), Memb(?/,X) (Sect. [1]). Each query wrt key y E U is landed in 0(1) time by the 
static sDS of TZ defined in ([2]) towards to the appropriate dynamic g*-heap pointed by the 
Pred(?/,7^) = iyth bucket. 

Properties: The only restriction is that Insrt(y,X) and Del{y,X) queries can occur in 
an arbitrary (round-robin) round but must satisfy that 0{n) keys (Property [2]) are stored 
per update step, while the rest queries are unrestricted. Since the Pred(?/, TZ) = iyth such 
landed bucket is implemented as g*-heap, each update wrt y E U takes 0(1) time by the 
corresponding g*-heap built-in function. The only concern (App. |Bl Rem. [3]) is to show that 
the iyth bucket load whp remains O(logn) per update step in file X (Th. [2] and Lem. |3]). 

3.2 Bird's eye view of the idea 

Uniform "toy" input. For a toy warmup, let us assume the uniform input distribution 
over U. We induce a partition of the universe U into p < n equal sized parts or buckets, 
in the sense that each bucket lies on equal number of keys from U. More precisely, let TZ 
in ([2]) now deterministically partition as [ro,ri) U [ri,r2) U . . . U [rp-i,rp] the universe U, 
with ro = 0, Tp = 2* = \U\ and each part size is |[rj,rj+i)| = ^,Vi E [p — 1]. Now, observe 
that for any target key y E U, we can locate within 0(1) time the bucket iy = [rj^, rj^+i) 
that y lies, in symbols r^^ < y < Tj^+i. Just benefit from the fact that all the buckets are 
equally sized and divide y by the bucket size This yields bucket iy = [y/^-^\- At this 
point, querying Pred{y, X) in the original file has been reduced in 0(1) time to querying 
PTed{y,Xn[riy,riy^i)) in bucket iy. So it remains to compute Pred(|/, Xfl [r^^, rj^^+i)) in 0(1) 
time. This is achieved by implementing each bucket z as a g*-heap [30], provided that each 
bucket i load |Xn [r^, rj+i)| whp remains O(logn) per update step, Vz G [p — 1]. By choosing 

^ We defer the (technical) tuning of the value a(n)n = ©(logn) in the proof of Th.[3l Intuitively, the keys in TZ are 
the Qf(n)n-apart keys in X, except from ro = and rp — 2*", being the endpoints of the universe U. 
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P ~ logn folklore that each bucket i = [rj,rj+i) whp contains load \X fl [rj,rj+i)| = 
©(logn) from a dynamic random file X G U sized n. 

/i-random input. Let us now generalize the above toy, but illustrative, assumption of 
uniform input. For each key x E U, assume an arbitrary known probability F[x] that key x 
is drawn to be inserted into file X. It is easy to exploit F[x] and partition the universe into 
(possibly p » n) buckets [rj, rj+i) so that whp each bucket load |Xn [rj, rj_|_i)| whp remains 
0{\.ogn). Just compute for each bucket i its endpoints r^, rj+i that satisfy Y^^x=ri ^[^] ~ ^^f^- 
Whp this imposes tight 0{\ogn) bucket load bound per update step, as required for the 
g*-heaps implementing the buckets to remain 0(1) efficient. 

However, now it is not so easy (as the division [y/^J above for the toy example of 
uniform input) to locate the bucket iy = [riy,riy^i) that query y lies in. Because now each 
bucket i has its own size |[rj,rj+i)| possibly 7^ The subtle reason is that the bucket size 
|[ri,rj+i)| depends on the accumulated probability mass ^^^^^ ■'^'l^] keys x in the 

bucket [rj,rj+i). Intuitively, a high valued probability function F[x] for the x's in [rj,ri+i) 
would induce very narrow endpoints rj,rj+i, while a low valued one would significantly 
widen the corresponding bucket interval. Here our rescue comes from storing all the bucket 
endpoints 71 in into the sDS which essentially is a VEB structure exactly as described in 
[221 Lem. 17]. Similarly as above, to determine the bucket [r^^, rj^^+i) that y lies in, it reduces 
to compute the predecessor of y in the sDS of 71, which takes 0(1) time. 

But, it remains the important concern of total sDS space to be explained. It is critical to 
cheaply encode each bucket endpoint G 7^ in ([2]) by consuming the fewest bits. Roughly, 
the bits per ri E 7Z must not exceed O(logn), otherwise the total sDS space would be too 
excessive. Observe that the required bits per endpoint G 7Z, as well as the total number p 
of endpoints in 71, are critically related to the smoothness of distribution function P (Def. [1]). 
Once more, if P is too "picky", inducing some dense bucket [rj,rj+i), then the corresponding 
endpoints rj,rj+i would be too close to each other. Thus to encode each such endpoint 
(amongst possibly p » n many ones) would sacrifice a;(log?7,) bits, yielding an overall space 
explosion of the sDS. We prove probabilistically (Th. |4]) that such a bad scenario of highly 
dense endpoints is quite unlikely when considering any smooth input distribution in the class 
of [3]. In this way, the total space is whp 0{n^~^^) for any constant 6 > 0. 

Unknown smooth input as in [3| . The morals discussed above will help us to tackle our 
final unresolved issue. As it is common in real world applications, here the input probability 
P is completely unknown to us, except that, P belongs to the class (Def. [1]) of smooth discrete 
distributions, which are more general (as Eq. (jl]) describes) than the ones previously studied 
in [3]. This causes a great obstacle for maintaining whp O(logn) load per bucket being 
implemented as a g*-heap: now p is unknown, so it is impossible to compute consecutive 
buckets [rj,ri+i) as to conveniently hold ^^^^i-l^f^] ~ ^^fr bucket i. 

A random b2b game will help us to identify some "well behaved" endpoints 7^ in ([2]) 
guaranteeing that each of the p—1 consecutive buckets accumulates O (^^f^) probability mass 
(Th. [2] and Lemma [3]). Notice that here the endpoints in 7Z are obtained probabilistically, 
by exploiting the strong tail bounds induced by a random game, a striking difference from 
the above examples, where 7Z was obtained deterministically by computing the summand 
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of probability P per bucket. Intuitively, at the initialization phase of our data structure, 
the set X is formed by n random inserted keys. Consider an arbitrary bucket I G U that 
(during the preprocessing phase. Sect. 13. ip contains load |X fl /| = 0(\ogn) from X. By 
the strong Binomial tail bounds, the probability that bucket / gets 0{logn) load, assuming 
that its probability mass Xli^g/^f^] '^^^^ the keys in / was either o (^^^) or uj (^^^), is 
exponentially small (Th. |3]). In words, the bad event that bucket I gets load that deviates 
too much its probability mass times n, is extremely unlikely. The possible consecutive such 
buckets I <Z U containing load |X fl /| = 0{\ogn) from X are bounded by n. It follows 
by the union bound that whp no bad event occurs at any such consecutive bucket. Also, it 
follows that this will be true during each of the subsequent rn update operations on X, with 
r = 0(1). 

3.3 The proof plan 

Step 1. In Section |4?T] we define a partition V of the universe U into \V\ = n'^i='^(^) equal 
sized parts, that is each part lies on the same number d-p of elements of U. Thus, each key 
y E U can be encoded into a single part of V, by applying a simple division using at 
most log (I PI) = Ci logn bits. In words, "P is a "cheap" way to index the keys of the universe 
U with at most Ci log n bits, which are enough for our "high probability" purposes. 
Step 2. In Sections [4. 4[ 14. 5l and l4.6l the goal is to show that whp V uniquely encodes (indexes) 
each representative key G 7^ defined in ([2]), when a{n)n = C4logn with C4 = 6>(1). It 
suffices to show that whp in each part of V it is possible to lie at most one element from 
TZ, in symbols, Vp G P it holds |p fl < 1, which is proved in Theorem HI an immediate 
consequence of Theorem [2] and Lemma [31 In particular. Theorem [2] shows that each (index) 
part p E V whp contains very few keys (say |p fl X| < Cplogn) from the dynamic file 
X (so from 7?. C X as well). In turn. Lemma [3] shows that whp each pair of consecutive 
keys rj,rj+i G 71 are sufficiently away or apart, in symbols |[rj,rj+i) fl X| > Cslogn, for a 
sufficiently large constant C3 > C-p. We conclude that Theorem H] is a consequence of the 
fact that too many keys from the dynamic file X lie between any pair of consecutive but 
"wide enough" endpoints rj,rj_|_i G 71, ruhng out -in turn- any hope for rj,rj_|_i to lie in the 
same "narrow enough" part p G P. 

Step 3. In Section 1121 we index by V an arbitrary query y E U and compute in 0(1) time 
its predecessor G 7Z using n^~^^ space, for any constant 6 > 0. We show this by uniquely 
indexing by P each G 7?. within Ci log n bits and store the so- indexed 7Z into our sDS (a 
static VEB as [221 Lem. 17]). 

Step 4. Finally, when handed y's predecessor G 7Z (by Step 3 above) in 0(1) time, in 
Section I^^Sl we compute in 0(1) time y's predecessor in the dynamic file X, which is the goal 
of this paper. This is achieved in 0(1) time in the corresponding g*-heap that implements 
the bucket of G 7Z, provided that O(logn) elements from X are stored in this bucket (Th. 
[Hand Lemma [2]). 

Step 4. Finally, the corresponding amortized times are proved in Sections 14.91 and 14.101 
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4 The proof details 



4.1 A n'^i=oW partition V of the universe U = [0, . . . , 2''] 

We show that each representative key in TZ is uniquely (and cheaply wrt bits) indexed by 
a partition V of universe U, in the sense that any two distinct keys of TZ are mapped via 
0(logr;,) bits to distinct parts of V. Let a partition V of the universe U = [0, . . . , 21^'] into 
\V\ = n^^^^^^^ equal sized parts, that is, each part lies on the same number d-p of elements 
of U. Constant Ci depends only on the smoothness (Def. [T]and (jlj)) parameters /i, f2- Each 
key y E U can be mapped into a single part of V, by applying a simple division P is a 
"cheap" way to index each key in U within log {\V\) = Ci logra bits, Ci = 0(1). The goal is 
to show that V uniquely indexes the representative keys of 71 by combining subsections 14.41 
and SSI 



4.2 Randomness invariance wrt the parts G P per update step 

The n random keys inserted at the preprocessing phase (Sect. 13. ip of n steps and each 
inserted key during the operational phase of i7(n^"'"^) steps, are drawn wrt an unknown 
but smooth (Def. [1]) input distribution P over U. Furthermore, each deleted key during the 
operational phase is selected uniformly at random from the currently stored keys. In Section 
14.11 we defined the deterministic partition V of the universe U = [0, . . . , 2^] into equal parts. 
Note that V is deterministic, thus, no part G P is constructed or biased by exploiting the 
random sample. That is, no information has been exposed from any observed random key 
as to facilitate (the length or position) of any part of V. From such P-random insertions and 
uniform deletions the work in [T7] implies that: 

Property 1 During each step of the preprocessing and operational phase the keys stored are 
^-randomly distributed per deterministic part ofV. 



4.3 0{n) keys are stored per update step of the operational phase 

Recall that during the operational phase (Sect. 13.11) queries Insrt(?/, X) and Del(?/, X) occur 
in an arbitrary (round robin) order and frequency, but must satisfy that overall 0{n) keys 
are stored in the dynamic file X per update step. On the other hand queries Pred{y, X), 
Kerah{y, X) are unrestricted. We conclude: 

Property 2 During each step t = 1, . . . , f^{n^^^), of the operational phase, the number 
of the keys stored in file X satisfy n < Cmin^^ < nt < C^g^^n, with n the number of keys in 
X at the end of preprocessing, and Cmin, Cmax constants. 



4. 4 Whp each part of P contains < Cp logn keys from file X, with C^p = 0(1) 

The most general class of unknown (/i, /2)-smooth input distributions, so that 0(loglog?7,) 
expected search times is achievable by IS, was defined in [3] with /i, /2 parameters as: 

fi{n) = - — 1+- and /2(n) = (3) 

log logn 
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with constants < a < 1 and e > 0. However, in our case, we will define an even more 
general class of smooth distributions: 

/i(n) =n^ and /2(n) =n" (4) 

with constants < a < 1 and 7 > 0. 

Theorem 2. Let n keys being inserted to file X at the Preprocessing phase, obeying an 
unknown {fi, f 2) -smooth distribution from the general class considered in We can com- 
pute constant Ci = Ci(/i,/2), such that if \V\ = n'^^ whp each part p e V contains 
\p n X\ < C-p log n random input keys during each update operation of the Operational 
phase (Sect. EJP, with = 0(1). 

Proof. See App. O 

4.5 Whp the keys in TZ are > C3 log n apart, with constant C3 > C-p 

Preprocessing (Sect. 13. ip partitions U into p parts of 0(^^^) probability mass. 

Recall from the preprocessing phase (Sect. 13. ip the ordered file A* in ([1]) and the a;(n)n-apart 
representative keys in 71 defined in ([2]). We set a{n)n = C4logn, with C4 = 0{1), which 
implies that key e 7^ in is the key = 2;(i+i)C4 logn £ in In this way, 

the p = [n/(C4logn)J keys in 71 induce a partition of the universe U. 

Observe that, opposed to partition P (Sect. 14. ip . the parts of are not necessarily 
equal. Also, opposed to partition "P, the endpoints of each part of T^-ji are not deterministic, 
instead these are sampled random keys wrt the unknown input distribution P. As a negative 
consequence. Property [T] is not further extendable to the corresponding parts of T^ti, unless 
distribution P was continuous. 

Theorem 3. Whp all parts ofT^n are spread on corresponding subintervals of the universe 
U with probability mass 0{a{n)) = 6> ( " ) where C4 = 0{1). 

Proof. See App. [Dl 

Operational phase: insert/delete balls into 0(logn) loaded bins. Recall (Sect. H75l) 
the partition T^-ji, induced by the keys in 7Z defined in ([2]) at the preprocessing phase (Sect. 
13. ip . We interpret each V-ji part as a bin and each key in file X as a ball. Theorem |3] implies 
that during the subsequent operational phase (Sect. 13. ip each new inserted ball into X will 
land to any given bin with probability ( '""^ " ) . The existing n balls, being inserted into 
file X until the end of operational phase, are considered as old. Property [2] implies that per 
update step t = 1, . . . , i7(n^+''), there are at most Ut < C^^^^n new balls thrown into the p 
bins. Thus the load of each bin, wrt to the new balls, is governed by a Binomial random 
variable with 6'(C4logn) expectation. Its folklore to use its strong tail bounds and prove 
(each bin also contains C4logn old balls) that: 

Lemma 1. We can compute C* = 0{1) > C4 : whp no bin induced by T^-jz gets load > 
C* logn. 
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It remains to show that per update step t = 1, . . . , f2{n^^^), whp there are at least C-pn 
new/old balls into each of the p bins. Trying to work as in the proof of the upper bound in 
Lemma [H a technical difficulty is that the distribution of the n old balls is not the same as 
the new ones. A way out is to remove all the n old balls (their sole purpose was to expose 
information from P and construct partition V-r, in a way that Th. [3] holds) and insert agaiij^ 
n P-random balls. However, at the end of these n random insertions, each V-ji part does not 
contain exactly C4logn balls, rather it is expected to contain 6'(C4 logra) balls. But now it 
is beneficial for the analysis that in each update step t, all the rit > n (Property [2]) existing 
balls are identically distributed into the p bins, obtaining the following lemma. 

Lemma 2. Per update step t = 1, . . . , il{n^'^^), each of the rit halls currently stored into file 
X appears into an arbitrary bin with probability 6>( '"'^|^^" ). 

An immediate consequence of lemma [2] is the following: 

Lemma 3. We can compute C-p^ = 0(1) with C-p < C-p^ < C4 : whp no bin induced by 
partition V-ji gets load < Cp^ \ogn. 

4.6 Whp P uniquely indexes each rj G 7?- with Ci log n bits, Ci = 0(1) 

Theorem 4. At the Preprocessing phase (Sect. \3. 1\) let n input input keys, drawn from 
an unknown {f f 2) -smooth distribution, to be inserted into file X. Then we can compute 
constant Ci = C(/i, /2,p) such that \V\ = nP'^ such that, during each update operation of the 
Operational phase (Sect. \3.1\) . each part p E V whp contains at most one key: |p fl < 1 
from the set TZ = {ri, r2, . . . , Vp} of the representatives of X . 

Proof. It is immediate by combining Theorem [2] (each part p E V contains < Cp log n keys 
from file X) and Lemma [3] (each pair of r^, rj G 7^ is at least C3 logn apart, with constant 
C3 > Cp). 

4.7 Storing IZ into a 0(1) time predecessor static data structure using 
0(n^+^) space, VS > 

In Sect. 14.11 we showed that each representative key in TZ is uniquely (and cheaply wrt bits) 
indexed by a partition V of universe U, in the sense of Theorem HI any two distinct keys of 
TZ are mapped via Ci logn bits to distinct parts of V. Let TZ this indexing of 7Z by partition 
V and store 7Z into a VEB data structure. For any target key y E U, indexed as y by V, 
we want to locate in 0(1) time the unique bucket iy = [rj-, rj^+i) that y lies, in symbols 
<y < which is identified by its bucket endpoints rj-,rj-_|_i G 7Z. It is obvious that 

Tj- = Pred(y, TZ) and below we take advantage of a VEB variant as described in [22l Lem. 
17]. 

Theorem 5. [22| Lem. 17] Let k, be any positive integer. Given any set 5* C [0,m — 1] of n 
keys, we can build in 0{n2'^logm) time a static data structure which occupies 0(n2'*logm) 
bits of space and solves the static predecessor search problem in 0(log( '°^ m-iogn ^^ time per 
query. 



It only induces an 0(n) time overhead to the preprocessing described in Sect. 13.1 
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In our case and terminology, we map the characteristic function of the set 7^ to a bitstring 
S with length \S\ = m that equals the number of possible values of the keys stored in TZ. To 
figure out how large is, recall from Sect. 14. II that each key e 7?. is uniquely indexed by V 
with Cilogn bits. Hence the number m of possible values of the keys in TZ are \S\ = m = 
2Ciiogn_ rpj^^g setting k = 6 we can build the predecessor data structure built on the TZ 
takes 0{n^+^) space and answers to queries in time 0(log( '°g"^~'°g" )) = 0{^^) = 0(1). 

Corollary 1. Setting k = Slogn, the predecessor data structure built on the TZ takes 0{n^^^) 
space and answers to queries in time 0(log( '°^™'~^°^" )) = 0{\og{ ^^^^ )) = 0(1). By setting 
K = |p^g|^g„ ; the data structure built on TZ takes O(r;,^"^^/'°siog") _ Q(^ji^+oW"^ space and 

answers to queries in time 0(log( ^°^"^~'°^"' )) = O ( log ( ^ 7ogl!°^ " ) ) = O (log log log n). Finally 

log log n 

by setting /t = 1, the data structure built on TZ takes 0{n) space and answers to queries in 
time 0(log( '°g";'°g" )) = 0(log((Oi - l)logn)) = O(loglogn) 

4.8 Implementing as a qf*-heap each ith bucket with endpoints in TZ 

Corollary 2 (Cor. 3.2, [30]). Assume that in a database of n elements, we have available 
the use of pre- computed tables of size o(n). Then for sets of arbitrary cardinality M <n, it 
is possible to have available variants of q* -heaps using 0{M) space that have a worst-case 
time of 0{1 + ]^^) for doing member, predecessor, and rank searches, and that support 
an amortized time 0(1 + -MiLL) insertions and deletions. 

^ log log n' ■' 

To apply Corollary [2] in our case, the database is the dynamic file X with n elements, 
so for our purposes it suffices to use pre-computed table of size o{n). Also, the sets of 
arbitrary cardinality M in our case are the p buckets [rj,rj_|_i) implemented as g*-heaps, 
so by Lemma [1] here M = 04 log n with O4 = 0(1). It follows that the worst-case time 
of each g*-heap is 0(1 + ]^^) = 0(1 + ^°^og ij"^"^ ) = 0(1) and the space per g*-heap is 
O (M) = O (O4 log n) = O (log°n) . 

4.9 0(1) whp amortized time 

The built time of DS is 0(n^+^), while each update step t = 1, . . . , il{n^^^) takes 0(1) time 
whp. It is straightforward that the amortized whp time is 0(1). 



4.10 Q(a/ , f " ) worst-case time 

^ Y log log n ' 

In order to get worst-case guarantees and ensure correctness of the data structure irrespective 
of number of keys insrted in the data structure, we will use two dynamic predecessor search 

data structures [5] with worst case 0(^ E^k^) Q^^ry and update times and linear space, 
see App. [El 
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A Unknown continuous (/i, /2)-smooth distributions 



Definition 2 ( [3]I18| ). Consider an unknown continuous probability distribution over the 
interval [a, b] with density function fi{x) = fi[a,b]{x). Given two functions fi and f2, then 
fi{x) = fj,[a,b]{x) is (/i, /2)-smooth if there exists a constant (3, such that for all Ci, 02,03, 
a < Ci < C2 < €3 < b, and all integers n, it holds that 

M2{n) 



rC2 

nX e [C2 - ^|^,C2] |C1 < X < C3] = / /i[ci,C3]( 



x)dx < 



n 



where /i[ci, c^{x) = for x < Ci or x > C3, and /x[ci, C3](x) = fi{x)/p for Ci < x < C3 where 
p = J^^ n{x)dx. 



B More on the preprocessing and operational phase 

Remark 1. All the p < n keys of the set TZ defined in ([2]) are stored -once and for all 
operational phase- into the sDS (which essentially is a VEB structure exactly as described in 
[22| Lem. 17]) and are not subject to any of the update steps: these keys are fixed. Their sole 
purpose is to land in 0(1) time each query request to the dynamic part of the DS which is 
a sequence of buckets being implemented as g*-heaps. The sorted file A" in ([1]) is considered 
as given, similar as P|18fl5] for real keys of infinite length and as [TB] for discrete keys. 
The p sampled a(n)n-apart keys in file 7?. in is a similar preprocess as P|18fl5j where 
from random file X a different sample was taken for their corresponding tuning reasons. 
Note that our robust operational phase lasts as long as f]{n^^^) time compared to 0{n) 
in [3|18|15j . The purpose of this is to cancel out the 0{n^^^) time to built the static FID 
and induce low amortized cost. Recall the infamous static structure in [5], Th. 4.5] requires 
time to built, but its dynamic version, although it is possible to built in 0{n) time, 
its running time is pumped up by a log log n factor [5i Cor. 4.6], ruling out any attempt to 
beat IS time. Finally, although the built time in [3p8] is 0(n), it assumes a rea/- RAM of 
infinite machine word length, which is different from the finite word RAM considered here. 
For example, in rea/-RAM each infinite real word occupies just 1 memory cell and real worlds 
can be processed in 0(1) time. 

Remark 2. As soon as a given query wrt key y E U is driven by the FID of TZ defined in 
([2]) towards to the appropriate iyth bucket being implemented as a g*-heap, the query is 
answered by the corresponding g*-heap built-in function in 0(l)-time. That is, all the keys 
of file X are stored in the buckets and only the content of each bucket is subject to any 
updates. 

Remark 3. This is a simple balls to bins (b2b) game which proves that the load per bucket 
remains O(logn) per update step: just prove that each g*-heap bucket lies on a subset of U 
with O (^^^) probability mass. 
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C Proof of Theorem [2] 



• First, we bound by n*"^ = n'~'^^^ the total number of equal parts of V, in a way that each 
part is expected to receive < logn random input keys per update step t = 1, . . . , i7(n^+^), of 
the operational phase (Sect. I3.ip . In other words, we want each such tiny V part to contain 
< probability mass from distribution V. 

Below we construct V recursively. Note that the endpoints of each such V part are 
obtained deterministically, depending only on the given parameters /i,/2 defined in 
Thus, Property [U governs the statistics of each V part per update step t = 1, . . . , i7(n^+^). 
Property [2] implies that per update step t = 1, . . . , f]{n^~^^) of the operational phase, for the 
current number rit of stored keys in file X it holds: 

nt < C^^^n = V (5) 

Initially, V partitions the universe \J = [0, . . . ,2l"'l] into /i(z^) > fi{nt) equally sized parts, 
with /i as in (jl]) and as in Smoothness (Def. [T]) and Eq. yield that each such 

V part gets < /jMiii < /jMll probability mass ((jl]) implies /2 is increasing wrt n) per 
update step t, with [3 a constant (Def. [1]) depending only on the particular characteristics of 
distribution P. Hence, during each update step t, each V part expectedly gets < (3^^^^ xrit = 
iSu" of the rit < u keys (by ([5])) currently stored in X. More accurately, the number of input 
keys distributed on any such V part has exponentially small probability of deviating from 
its expectation, thus, also from getting higher than its upper bound For simplicity and 
without loss of generahty we will not take into account constant (3 (Def. [1]). Subsequent 
deterministic partitioning is applied recursively within each such V part, until we reach a 
sufficiently small V part with probability mass as low as possible in order for its expectation 
to be < logn (recall n is \X\ as operational phase starts. Sect. 13. ip per update step t = 
1, . . . , i7(n^^'^). Let h be the number of such recursive partitions, then, it suffices: 

f lnln(n) \ f Inln(n) ^ 

z."" <\ogn^h< ^^^^ < \ '"("^ / = - logjln(n)) (6) 

In a In a 

where the strict inequality follows from (|5]). 

Remark 4- Note that ([6]) implies that each such tiny V part gets an arbitrary but < prob- 
ability mass q{n) wrt the unknown P distribution, with = |X| at the start of operational 
phase. 

Bellow we upper bound the total number of V parts induced until the h recursions grow up 
to the value of (Q. 

In the 1-st partition of the universe U, the number of parts is /i(i^) = /i 
u'^""^. In the 2-nd recursive partition, each part will be further partitioned into /i(z^") = 

subparts. In general, in the {i + l)-th recursive partition the 
subrange will be divided into /i(z^"') = (z/"')'^ = u"'""^ subparts. Taking into account Eq. (Q, 
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in the final level of recursive partition, the total number jP] of subparts is 



\v\ = Uf 



i=0 



1 



Ui"")' < n 



■7 = jyEi=o" -7 = ^7 „_i 



j=0 



i=0 



IP I < Cr^aSnT^ 



(7) 



with z/ as in ([5]) and a as in (|4]). Hence, the total number of bits needed for each such final 
part of V is at most 



log(|P|) 



7 



1 — a 



log n + log ( CjJiaS ) < Ci log n 



• Finally, we show that the probability that, any V part amongst the n'-'^ many depicted by 
([7]) receives > C-p x logn input keys, approaches to as n grows large, with Ci&C-p = 0(1). 
Without loss of generality, we compute the probability that a final V part with arbitrary 
probability measure q{n) (with g(n) dominated by the bounds of by Rem. H]) contains a{n) 
input keys. This probability equals 



n 



g(n)"(")"(l-g(n))(^-°("))" 



a(n) 



1 — q{n) 
1 — a{n) 



l—a{n) 



(9) 



Remark 5. The expression on the right of (|9]) is asymptotically equal to the expression on 
the left if we use Stirling's approximation n\ ~ (^) ylTrn and ignore inverse polynomial 
multiplicative terms. Expression ([9]) is a convex function of two variables {q{n) and a{n)) 
and achieves its maximum when q{n) = a{n). 

Note that the probability that a fixed V part gets load of keys from X that deviates to any 
higher value than a{n)n = (1 + 5) logn = C-p logn from its expected load < logn (Rem. E]) 
is at most: 



n 



a(n] 



a(n) 



1 — q{n) 
1 — a{n) 



l—a(n) 



— ^ 0, n — 7- oo 



(10) 



since all the higher load values than a{n)n = (1 + 5) logn = C-p logn are < n many, in 
addition, each such higher (more deviating) value occurs with probability at most the one 
in (jo]). In addition, there are n*^^^-* final parts (by Eq. ((Tj)) in V, so the union bound yields: 



n 



Oil) 



a{n) 



a{n) 



q{n) 



an 



l—a(n) 



— )■ 0, n — )■ cxo 



:iii 



vanishing probability that any tiny V part gets any higher load from X than a{n)n 
(1 + 5) log n = C-p log n. 
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D Proof of Theorem [3] 



Consider the ith part Jj of the universe U defined by the partition V-ji, i ^ [p]- Let qi{n) the 
total probabihty mass spread by distribution P over all keys of U that lie in part /j, that is, 
= "l^yGii ^(y)- Now, let us assume the bad scenario that the probability mass qi{n) of 
this part /j is either qi{n) = u{a{n)) or qi{n) = o{a{n)). According to this bad scenario, part 



li of V-ji has a given probability mass qi{n) 



' C4, log n 'i 



or qi{n) = o C""^'"^" ) and contains 



C4logn consecutive keys from X. But, the probability of this bad scenario wrt part Jj of V-r, 
is governed by the Binomial distribution: 



n 



a{n)n 



a[n] 



a{n) 



1 - qijn) 
1 — a{n) 



l—a{n) 



^ 0,n ^ c^2) 



which by Remark [5] vanishes exponentially in n for qi{n) = u{a{n)) or qi{n) = o{a{n)). Note 
that there are p < n possible realizations of such bad scenaria (possible blocks of C4 log n 
consecutive keys) hence the union bound gives: 



n X 



max 

ie[p] I \ a(n 



a(n) 



1 - qijn) 
1 — a(n) 



l—a{n) 



^0, 



(13) 



which also vanishes when each qi{n) = u{a{n)) or qi{n) = o{a{n)). Thus, whp all parts 
of Vtz are spread on corresponding subintervals of the universe U with probability mass 
0{a{n)) =eC- 



/Cif C4 log n ^ 

n 



E Proof of 0(J , ) worst -case time 

^ Y log log n ' 

1. In the first predecessor data structure which we note by B\ we initially insert all the 
numbers in interval [l,p]. Then Bx will maintain the set of non empty buckets during 
the operational phase. The role of is to ensure the correctness of predecessor queries 
when some buckets get empty. 

2. The second predecessor data structure which we note by B2 is initially empty. The role 
of B2 is to store all the overfiowing elements which could not be stored in the g*-heaps 
corresponding to their buckets. 

handling overflows. In addition, we maintain an array of p counters where each counter Cj 
associated with bucket i stores how many keys are stored inside that bucket and assume that 
the capacity of the g*-heap associated with each bucket is C = 0{logn). Now at initialisation, 
all the buckets have initial load 0{logn) and all the keys of any bucket i are stored in the 
corresponding g*-heap. Then at operational phase the insertions/deletions of keys belonging 
to a given bucket are directed to the corresponding g*-heaps unless it is overfiown. More 
precisely when trying to insert a key x into a bucket number i and we observe that Q > C, 
we instead insert the key x in i?2. Symmetrically, when deleting a key x from a bucket i when 
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Ci > C proceeds as follows : If the key x is found into the g*-heap, we delete it from there 
and additionally look into B2 for any key belonging to bucket i and transfer it to g*-heap of 
bucket i (that is delete it from B2 and insert it into the g*-heap). If the key x to be deleted is 
not found in the g*-heap, we instead try to delete the key from i?2- By using this strategy we 

ensure that any insertion/deletion in any bucket takes at worst 0{^J^^^^) time and still 

0(1) time whp. Queries can also be affected by overflown buckets. When the predecessor of 
a key x is to be searched in an overflown bucket i (that is when a predecessor search lands 
into bucket i with q > C) and the key is not found in the corresponding gf*-heap, then the 

key X is searched also in B2 in time 0(y^ l^^^)- event of an overflowing bucket is 

expected to be very rare the performance of queries is still 0{1) time whp. 

handling empty buckets. The data structure Bi will help us handle a subtle problem which 
occurs when we have some empty buckets. Suppose that we have a non empty bucket i 
followed by a range of empty buckets [i + 1, j]. Then the answer to any predecessor search 
directed towards any bucket /c G [i + 1, j] should return the largest element in bucket i. Thus 
in the highly unlikely case that a predecessor search lands in an empty bucket k (which is 
checked by verifying that = ) we will need to be able to efficiently compute the largest 
non empty bucket index i such that i < k and this can precisely be done by querying Bi 
for the value k which obviously will return i as Bi is a predecessor data structure which 
stores precisely the index of non empty buckets and i is the largest non empty bucket index 

preceding k. This last step takes C>(y/j^^) = 0{^^^^) t ime. What remains is to show 
how to maintain Bi. For that we only need to insert a bucket i into Bi whenever it gets non 
empty after it was empty or to delete it from Bi whenever it gets empty after it was enon 
empty and those two events (a bucket becoming empty or a bucket becoming non empty) 
are expected to be rare enough that the time bound for any insertion/deletion remains 0{1) 
whp. 
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